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1 TTGCTGACTC ATGTGCCCGC AGCTAGCAGG AGCTGGCAGC ATGGGCTCCC 
51 CAGGGGCTAC GACAGGCTGG GGGCTTCTGG ATTATAAGAC GGAGAAGTGG 
101 GCTCTCCTCG CCAAAAAAGG CTACCAGGAG CGGGACCTGG AACCCCAGTT 
151 TTCCATCATC ACCAAACTCA AAGGGGTTTC CGTCACTCAG ATCAAGGAGC 
201 TTGGAAACCG GCTGTGGGAT GTGGCCGACT TCGTGAAGCC ACCTCAGGGA 
251 GAGAACGTGT TCTTCTTGGT GACCAACTTC CTTGTGACGC CAGCCCAAGT 
301 TCAGGGCAGA TGCCCAGAGC ACCCGTCCGT CCCACTGGCT AACTGCTGGG 
351 TCGACGAGGA CTGCCCCGAA GGGGAGGGAG GCACACACAG CCACGGTGTA 
401 AAAACAGGCC AGTGTGTGGT GTTCAATGGG ACCCACAGGA CCTGTGAGAT 
451 CTGGAGTTGG TGCCCAGTGG AGAGTGGCGT TGTGCCCTCG AGGCCCCTGC 
501 TGGCCCAGGC CCAGAACTTC ACACTGTTCA TCAAAAACAC AGTCACCTTC 
551 AGCAAGTTCA ACTTCTCTAA GTCCAATGCC TTGGAGACCT GGGACCCCAC 
601 CTATTTTAAG CACTGCCGCT ATGAACCACA ATTCAGCCCC TACTGTCCCG 
651 TGTTCCGCAT TGGGGACCTC GTGGCCAAGG CTGGAGGGAC CTTCGAGGAC 
701 CTGGCGTTGC TGGGTGGCTC TGTAGGCATC AGAGTTCACT GGGATTGTGA 
751 CCTGGACACC GGGGACTCTG GCTGCTGGCC TCACTACTCC TTCCAGCTGC 
801 AGGAGAAGAG CTACAACTTC AGGACAGCCA CTCACTGGTG GGAGCAACCG 
851 GGTGTGGAGG CCCGCACCCT GCTCAAGCTC TATGGAATCC GCTTCGACAT . 
901 CCTCGTCACC GGGCAGGCAG GGAAGTTCGG GCTCATCCCC ACGGCCGTCA 
951 CACTGGGCAC CGGGGCAGCT TGGCTGGGCG TGGTCACCTT TTTCTGTGAC 
1001 CTGCTACTGC TGTATGTGGA TAGAGAAGCC CATTTCTACT GGAGGACAAA 
1051 GTATGAGGAG GCCAAGGCCC CGAAAGCAAC CGCCAACTCT GTGTGGAGGG 
1101 AGCTGGCCCT TGCATCCCAA GCCCGACTGG CCGAGTGCCT CAGACGGAGC 
1151 TCAGCACCTG CACCCACGGC CACTGCTGCT GGGAGTCAGA CACAGACACC 
1201 AGGATGGCCC TGTCCAAGTT CTGACACCCA CTTGCCAACC CATTCCGGGA 
1251 GCCTGTAGCC GTTCCCTGCT GGTTGAGAGT TGGGGGCTGG GAAGGGCGGG 
1301 GCCCTGCCTG GGGATCTCAA GGATGAGGCC CCAGCATGGA GGATTGGGGG 
1351 TAGAATTCCA CCCTTGAACC CCAGCAGACA GTCCCTCCCC TGACTCCCAC 
14 01 CTTGGTAGGG TGCTGCCTCA GGGAGCCATA GAAGTCGGCT GTGTTTTGAG 
1451 ACGGCGACAG AACCTGACCC GTGGAGACTG GGAGAGCCCA GCAGGCACCT 
1501 GTATTGCAGG GCTCCGACTG CATGTGGCAG GGGCTCCTGC TGCGTCTGGG 
1551 CCTGGAGGTC TCTCTCCCAG TGCTCTGTCC CCAGTGTTCC TAGCAGAGGT 
1601 ATGCTTACCA GCTGTCAGCA CAGACCCTCC TGCTGCCTGG GTCCTGGCCC 
1651 TCCTCCCCCA TCTGCACCCC CATCATAGGT AGAGACCCCA CCCTCCCATC 
1701 GGTCCTACAT GGGGCTGTGC AGCTGGAGCC AAAAAGGCAA GGCAGAAAGA 
1751 GGAGTGATGG GGGAGGGGGA TTGTTTCAGC TTCTCTGGTG CTGTGATGCC 
1801 CCAGGAGAGT CCTAATCTAG GGAATGGGGT GGAGTAGGCA GATAATCCAC 
1851 CTCCCTATCC CCCAGGCAAG GGCGGAGCAT GTGTCTTGGG CCCACACCTG 
1901 CTTAGTTTAT GAGGACCGGC TGCTTTCCAG TGGTAGCCCT TTTGCCATGG 
1951 AGGTCTGGGA GAGAGAGCAG AGGGCGGCAG GGCTAAGTTG GTGATCATTG 
2001 GGTTCTTCAG GACCTTCTAT ATCCCTCCTC GGTAACCCCC CAGCCCAACC 
2051 CCTTGGAATC TTTCCTCCAG GCTTCCTGAG AGCCCTGGGG GTGGGAGGCT 
2101 GTGGGAGGCT GTACATCTGA AATTCACTTC AGTCCAAGTC ATACCTAGGA 
2151 AGCTGTCTGG GCAGCTGCTC GAGGGAGGCC CTGGCTCTGA TCCCAGGCTG 
2201 GATGGAGTGG CTGGAAGGAA TGGTTCCAAA CAACACCACC GAGATCTCCC 
2251 TCAGGCTGGC CAGGTTTTGC AGCTGGAATT CTCCTCTTGG TCCCAGGGCG 
2301 GGGCAGGGAA TTCTAAGTGT CCACCCCAGG GAGGCAAGGG GCTGCTTTCC 
2351 ACTGTGGGTA CCTGGTGATC AGGGCAAGCT GTGGAGGGCC AGGGGTGGGG 
2401 CTGAGACTGG GCTGACATCT AGAATCACCT GCCACCTGGA GCCTCAGTAA 
2451 AATGCCTGGG GTCCCTGCTG CCTCTCAATC TCCAGAGCCA TGTCCATGGG 
2501 GAGGTGGGCT CTGAAGGGCG AAGGTGGGAG AGCAGGGCCC CTGAGGCCTG 
2551 GGTATCCAAG GAGGGGCACG TGCACCTGAT TCTCCTTGGG GCCCAGAGGA 
2601 AGCTGATGTC ATGGCTGGAC AAAGTCACGG AGTAAAGCCA GCAAAGCCAC 
2651 CAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 
(SEQ ID NO: 1) 

FEATURES : 

5'UTR: 1 - 40 

Start Codon: 41 

Stop Codon: 1256 

3'UTR: 1259 
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HOMOLOGOUS PROTEIN: 

Top 10 BLAST Hits: g 

Sequences producing significant alignments: ^ttV J^ 1 ^ 

CRA 18000005098398 /altid-gi | 4885535 /def =ref | NPJJ05437 . 1 | pun... 857 0.0 

CRA 335001098681202 /altid=gi | 11417813 /def -ref | XPJ509854 . 1 | pu . . . 857 0.0 

CRA 1000682348238 /altid-gi | 6469324 /def =gb | AAf«303 . 1 | AF065385 . . . 855 0.0 

CRA 
CRA 
CRA 
CRA 
CRA 
CRA 



18000005129684 /altid=gi | 6754966 /def=ref 
18000005027891 /altid-gi | 6981322 /def=ref 
148000001425983 /altid=gi | 7920253 /def=gb 
18000005038217 /altid=gi | 7447773 /def=pir 



NP_035158.l| puri... 621 e-177 

NP_036853.1| p2X6. . . 604 e-172 

AAF70599.l|AF2050. . . 360 2e-98 

S71344 purinergi... 348 8e-95 



18000005027890 /altid=gi | 1709522 /def=sp| P51578 | P2X5 RAT P2 . . . 345 7e-94 
18000005064403 /altid=gi | 4505549 /def =ref | NP_002551 . 1 | pun... 318 9e-86 
18000005196095 /altid-gi | 4099121 /def =gb | AAD00553 . 1 | (U8399... 318 9e-86 



EST: 

Sequences producing significant alignments 
gi | 11617343 /dataset=dbest /taxon=96 . . . 
gi | 6992441 /dataset=dbest /taxon=960. . . 
gi | 4990980 /dataset=dbest /taxon=9606 ... 
gi | 10325489 /dataset=dbest /taxon=96 . . . 
gi | 2195075 /dataset=dbest /taxon=9606 ... 

EXPRESSION INFORMATION FOR MODULATORY USE: 

gi | 11617343 Brain- anaplastic oligodendroglioma 

gi | 6992441 Chronic lyphocytic leukemia 

gi | 4990980 Lung- carcinoid 

gi | 10325489 lung - large cell carcinoma 

gi | 2195075 Colon 

Tissue expression: 
Whole brain 



Score E 

(bits) Value 

1164 0.0 

648 0.0 

579 e-163 

464 e-128 

287 4e-75 
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1 MGSPGATTGW GLLDYKTEKW ALLAKKGYQE RDLEPQFSII TKLKGVSVTQ 

51 IKELGNRLWD VADFVKPPQG ENVFFLVTNF LVTPAQVQGR CPEHPSVPLA 

101 NCWVDEDCPE GEGGTHSHGV KTGQCWFNG THRTCEIWSW CPVESGWPS 

151 RPLLAQAQNF TLFIKNTVTF SKFNFSKSNA LETWDPTYFK HCRYEPQFSP 

201 YCPVFRIGDL VAKAGGTFED LALLGGSVGI RVHWDCDLDT GDSGCWPHYS 

251 FQLQEKSYNF RTATHWWEQP GVEARTLLKL YGIRFDILVT GQAGKFGLIP 

301 TAVTLGTGAA WLGWTFFCD LLLLYVDREA HFYWRTKYEE AKAPKATANS 

351 VWRELALASQ ARLAECLRRS SAPAPTATAA GSQTQTPGWP CPSSDTHLPT 

401 HSGSL {SEQ ID NO: 2) 

FEATURES : 

Functional domains and key regions: 
[1] PDOC00001 PS00001 ASN_GLYCOS YLAT I ON 
N-glycosylation site 

Number of matches: 3 

1 12 9-132 NGTH 

2 159-162 NFTL 

3 174-177 NFSK 



[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP- dependent protein kinase phosphorylation site 
368-371 RRSS 



[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches : 2 

1 17-19 TEK 

2 131-133 THR 



[4] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 2 

1 217-220 TFED 

2 336-339 TKYE 



[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 



Number of 


matches : 


10 


1 


2-7 


GSPGAT 


2 


5-10 


GATTGW 


3 


45-50 


GVSVTQ 


4 


113-118 


GGTHSH 


5 


119-124 


GVKTGQ 


6 


130-135 


GTHRTC 


7 


146-151 


GWPSR 


8 


225-230 


GGSVGI 


9 


297-302 


GLIPTA 


10 


306-311 


GTGAAW 



[6] PDOC00932 PS01212 P2X_RECEPTOR 
ATP P2X receptors signature 

225-251 GGSVGI RVHWDCDLDTGDSGCWPHYSF 

Membrane spanning structure and domains: 
Helix Begin End Score Certainity 

1 69 89 0.782 Putative 

2 299 319 1.835 Certain 



BLAST Alignment to Top Hit: 

>CRA| 18000005098398 /altid=gi | 4885535 /def =ref | NP_005437 . 1 1 

purinergic receptor P2X-like 1, orphan receptor; P2X 
specifically expressed in skeletal muscle; purinoceptor 
P2X6 [Homo sapiens] /org=Homo sapiens /taxon=9606 
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/dataset=nraa /length=431 
Length =431 

857 bits (2189), Expect =0.0 



Identities = 405/431 (93%), Positives = 405/431 (93%), Gaps = 26/431 (6%) 



Query: 1 MGS PGATTGWGLLDYKTEK WALLAKKGYQERDLE 34 

MGSPGATTGWGLLDYKTEK WALLAKKGYQERDLE 
Sbjct: 1 MGS PGATTGWGLLDYKTEKYVMTRNWRVGALQRLLQFG I WYWGWALLAKKGYQERDLE 60 

Query: 35 PQFS I ITKLKGVSVTQIKELGNRLWDVADFVKPPQGENVFFLVTNFLVTPAQVQGRCPEH 94 

PQFS I ITKLKGVS VTQIKELGNRLWDVADFVKPPQGENVFFLVTNFLVTPAQVQGRCPEH 
Sbjct: 61 PQFS I ITKLKGVSVTQI KELGNRLWDVADFVKPPQGENVFFLVTNFLVTPAQVQGRCPEH 120 

Query: 95 PSVPLANCWVDEDCPEGEGGTHSHGVKTGQCWFNGTHRTCEIWSWCPVESGWPSRPLL 154 

PSVPLANCWVPEDCPEGEGGTHSHGVKTGQCWFNGTHRTCEIWSWCPVESGWPSRPLL 
Sbjct: 121 PSVPLANCWVDEDCPEGEGGTHSHGVKTGQCVVFNGTHRTCEIWSWCPVESGVVPSRPLL 180 

Query: 155 AQAQNFTLFIKNTVTFSKFNFSKSNALETWDPTYFKHCRYEPQFSPYCPVFRIGDLVAKA 214 

AQAQNFTLFIKNTVTFSKFNFSKSNALETWDPTYFKHCRYEPQFSPYCPVFRIGDLVAKA 
Sbjct: 181 AQAQNFTLFIKNTWFSKFNFSKSNALETWDPTYFKHCRYEPQFSPYCPVFRIGDLVAKA 240 

Query: 215 GGTFEDLALLGGSVGIRVHWDCDLDTGDSGCWPHYSFQLQEKSYNFRTATHWWEQPGVEA 274 

GGT FEDLALLGG S VG I RVHWDCDLDTGD S GCW PHYS FQLQE KS YNFRTATHWWEQ PG VE A 
Sbjct: 241 GGT F EDLALLGGS VG IR VHWD CD LDTGD SGCW PHYS FQLQEKS YNFRTATHWWEQ PGVE A 300 

Query: 275 RTLLKLYGIRFDILVTGQAGKFGLIPTAVTLGTGAAWLGWTFFCDLLLLYVDREAHFYW 334 

RT LLKL YG I RFD I LVTGQAGKFG L I PTAVTLGTGAAWLG WT F FCDLLLL YVDREAH F YW 
Sbjct: 301 RTLLKLYG I RFD I LVTGQAGKFG L I PTAVTLGTGAAWLGWTFFCDLLLLYVDREAHFYW 360 

Query: 335 RTKYEEAKAPKATANSVWRELALASQARLAECLRRSSAPAPTATAAGSQTQTPGWPCPSS 394 

RTKYE E AKAP KATANS VWRE LALAS QARLAE CLRRS S APAPTATAAG SQTQT PGW P C P S S 
Sbjct: 361 RTKYEEAKAPKATANSVWRELALASQARLAECLRRSSAPAPTATAAGSQTQTPGWPCPSS 420 

Query: 395 DTHLPTHSGSL 405 

DTHLPTHSGSL 
Sbjct: 421 DTHLPTHSGSL 431 (SEQ ID NO : 4) 



Hxnmer search results (Pf am) : 

Scores for sequence family classification (score includes all domains) : 



Model 


Description 


Score 


E -value 


N 


CE00369 


E00369 P2X6_receptor 


1180.5 


0 


2 


PF00864 


ATP P2X receptor 


870.0 


7.4e-258 


1 


CE00207 


CE00207 PURINERGIC 


366.8 


5.9e-lll 


1 


CE00370 


E00370 P2X4_receptor 


336.8 


1.9e-109 


1 


CE00368 


E00368 P2X7_receptor 


124.1 


6.5e-36 


1 


PF00095 


WAP-type (Whey Acidic Protein) 1 four-disulf i 


8.7 


1.1 


1 


PF01841 


Transglutaminase- like super family 


6.0 


6.3 


1 


PF01368 


DHH family 


2.5 


6.8 


1 



Parsed for domains: 



Model 


Domain 


seq-f 


seq-t 


hmm-f 


hmm- 1 




score 


E -value 


CE00369 


1/2 


1 


19 [. 


1 


21 


[• 


36.3 


2.1e-ll 


PF00095 


1/1 


87 


Ill . . 


1 


40 


[. 


8.7 


1.1 


PF01841 


1/1 


120 


130 . . 


1 


11 


[. 


6.0 


6.3 


PF01368 


1/1 


221 


237 . . 


1 


19 


C 


2.5 


6.8 


CE00368 


1/1 


54 


299 . . 


85 


333 




124.1 


6.5e-36 


CE00370 


1/1 


20 


338 . . 


46 


372 




336.8 


1.9e-109 


CE00207 


1/1 


20 


345 . . 


47 


393 




366.8 


5.9e-lll 


CE00369 


2/2 


20 


351 . . 


48 


379 


*J 


1143.5 


0 


PF00864 


1/1 


20 


354 . . 


34 


395 


J 


870.0 


7.4e-258 
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1 TCTCCAAGTC CATGGGTGCC TGGTAGGAGA CAGGGGGATG AATGTGAACC 
51 CCTGCATGGC TATAGCCACC TGCCTCCTCC CCTGCCCTGC ATCACTACCT 
101 GGCCTATTTT TTGCCTCTAG AAGCACTGCT TCCTATGCTC CTTAGGACCA 
151 CTGCCCGCAT ATGACAGATA AGAACATCGA GGCTAAGGCA ACGCAAATCT 
201 TTTCCTTAAA GTCATACAGC TGTCAAAAGA AAGCTGGACA ACCTGGGCAA 
251 CATAGCGAGA TAAAAAATTA TTTAAATTAG CCAGATGTGG TAGCCCCCTG 
301 TAGTCTCAGC GACTCAGGAG GCTGAGGCAG GAGGCTCACC AGAGTGCAGA 
351 GTTCAAGGAT GCAGTGAGCT ATGATCCTGC CACTGCACTG AAAGCTGGGT 
401 GACAGAGCAA GACCCTGGCT CTAATAAATG AATACATAAA GTCTCACAGC 
451 TAGTGGTAGC TAATCCTGCC AGAGTCAGGC CTCTACCTGT CTGATGACAA 
501 ATGGCACACT ATGTCTTTTA ACCTGATTGC AGACCACAAA TGTTTTGTGA 
551 ATATTTTCCC CAGGGAAAAA ACCGGAAGTA GTTCTAAATT CTATACATCC 
601 ATTATATTAG TTTTACCTGT GGATTGGGAA AACCCAGCTC TGATTGCATT 
651 TCAGGGCGGG ACAGCCTTTG GTGCACTGTC TGGCGGGATT TTCCATTTTA 
701 ACCTCCTTCT AGAAGCGCCT TCTCATGGTA AAGTTCCTGA TGCCGCCAGG 
751 AGCGCCGAGG AGAGGGCAGG GGGCTGGAGA CGCCCCGCAG AGGGCTACGT 
801 GCCCTGCTGG ACAGAGGTCT CCTGCCTCCT CGGCGGCGCC AGCCCACCTC 
851 CCACAACCCC TGCGGGAGAA GCCCCCAAGG GGAGGAGACG GGCCTGGCCC 
901 CTGCCCCGAG CACCTTCCGT CTCTAGGTCG GAGTCTGAAT CGGCCTTGGG 
951 ACCCTGCTTG GCTTCGGGGA CCCCTGCAAG ACGTCCACAG GCCGCCGTCG 
1001 CCTCTTCCTC CTGCTTTTTA TCCTCCCCAG ACCTCTGGCA GGAACCGCTC 
1051 ATCGTTACGC CCCTTTCGCA GCCTCAGACC CTGAGGCGGA GACCGCTTGG 
1101 CGCCTCACTT AGAGCGCGAC CCGGGGATGT GGGCGGAGTC TGCGGCTGCG 
1151 CTGACCAATC GAGTGTGGCG TCCATCGACT GGCGTCTGCC ACGGCAATTA 
1201 GCGACGCGCT CCCCCGCGGC GGTCGCCCCG GCAACCCAGT GCTGTAGGTT. 
1251 GCCGTAGAAA CCGTGGCTCT CCTGCGCTGA GGCTCCTCGC CTGAGAGGAT 
1301 AAACTGCACG CGCCACGGGC TATGCACTGG GCTGGGCGCC TTGTGGGCAT 
1351 CCTCCCTGCC TTCCTAGGGG GTTCCAGCAT CGCCCCCCTT TCGTGGACTG 
1401 GGAAACACGC CTGACTCCAG GACTTGTGTT GTCCTCACTG CACTGGGGAA 
1451 GGTGGCGGGG GCAGCTTTTC AGGAGGGCCT GGGGAACTTC GCAGAGCCAG 
1501 GTCACCCTCT CACTCTGTGC CTCTTAGTTA TCTTGCATGC TCTGGTCTTT 
1551 GCATACGCTG CTCCCTGCAC CAGGAACCTC CATCCCCATC TTTGTCTGCT 
1601 TGTCGAACTT CAGAAATCTG CAAGGGTCAG CTTAGAGGTC ACTTCTTCCG 
1651 GAAGCTTTCC TCAACACCCT CCCCGCCCTG CTGCTGCTGC CCTCAGGCCC 
1701 TCCTCTCACA GCACTGATAA CAGCTGTCCG TCTCCACCCT CCCACCACCT 
1751 CCACTCCCAC CCCAGGAAGT GAGGCCAGAG GGCAGGGACA GAGCTGCTGC 
1801 TGTTCTCTGT GTGCCAGGGC CCAGCAAAGG GAATGTAGGG AGGGTGGGAG 
1851 GTGCAGGGCA GCTGGGATTA GGGGTTGAGG GCTGGGTGTT GGAGGCTGGA 
1901 TCTGGATCCT GCTTTAGTGG AAGTGTCCCT TTAACAGCAA CTGGCCTGGC 
1951 CTGGCTCGGG CCCTGCTTTG CCTCCTGTTC AGCTGCGGCT GCAGCTGCCA 
2001 TGCTGACTCA TGTGCCCGCA GCTAGCAGGA GCTGGCAGCA TGGGCTCCCC 
2051 AGGGGCTACG ACAGGCTGGG GGCTTCTGGA TTATAAGACG GAGAAGTATG 
2101 TGATGACCAG GAACTGGCGG GTGGGCGCCC TGCAGAGGCT GCTGCAGTTT 
2151 GGGATCGTGG TCTATGTGGT AGGGTAAGAG AGAAGAGCTT TTGGCCAGGC 
2201 TGGAGGGGCA AGGGAAGAGG TGGGGGGTGG GGCTTGGTCC TGCTGGGTTG 
2251 AAGTTGAGGG TTGGGCTGTT TAGGGGCTGG AGTGGAAGGG GGCAGATTGG 
2301 GACGGGGTTG GGGAGAGCTA GGCGATACAA GACAGGAGAG CAAGAACAAG 
2351 CTGTGTGTTT GTCCTGTGTG TCCACTTGCC TCCTTCCCAG GCCCCCACCC 
2401 AGGCCCCACC CAGGGGGCAC ATGACATAGT CCTTAACATC TGTGAGAGCT 
2451 GGAGCACTAG GCCCCCAGAG AGACCACCAG CTGTATCTCG GGTCAGGAGA 
2501 GTCTGTAAGG GGGAAGCTGG ATCTAGTCAG GCTGGGGGTG GGTGCTGGCT 
2551 AGTGAAGGTG ATTGTCTGAG GGCATTGGCT CTCTGATGCA TGGCTGGAGC 
2601 TTCTGTCTCA TTCAGGGGGT CTGGAGTGGG AAGTGGGGCC AGAGAGGAGG 
2651 TGGGGCCTTC GATGTTGGGC CGGGAGCCTG TAGGGTGTGG GGGGAGAACT 
2701 GAGCATGTAG GGCTCAGCTC CGCCCCTGTC ACTACACGCT GGGGACACAC 
2751 CACACTGCCC GACTTCTCCT CCCCAGGTGG GCTCTCCTCG CCAAAAAAGG 
2801 CTACCAGGAG CGGGACCTGG AACCCCAGTT TTCCATCATC ACCAAACTCA 
2851 AAGGGGTTTC CGTCACTCAG ATCAAGGAGC TTGGAAACCG GCTGTGGGAT 
2901 GTGGCCGACT TCGTGAAGCC ACCTCAGGTG GGGGCCCTGA TGTTGCTGAC 
2951 GGGGGCGCAA GTCCTTTCCC CACTGACAGC CTGAACACCC GCCATGCAGC 
3001 CAGTGTGTGC GAGAGAGAAG CATGTGATGC CAGAGACGGC TGCGGGTTCT 
3051 CAGGAAGGGC TTCACAGAGG AGTGGCACCT GGACAGGACT TTCAGGGATG 
3101 TGTAGGAGGT TTTGGGGTGG AAAAAGGGGC CACTCAAGAA GCCAGGCCAG 
3151 GGTTGGACGT GCTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA 
3201 GGCAGGTGGA TCACGAGATT GAGAGTATCC TGGCTAACAC GGTGAAACCC 
3251 CATCTCTATT AAAAATACAA AAAATTAGCC GGGCATGGTG GTGGGCGCCT 
3301 GTAGTCCCAG CTACTCGGGA GGCTGGGGCA GGAGAATGGC ATGAACCCGG 
3351 GAGGTGGAGC TTGCAGTGAG CCGAGATTGC ACCACTGCAC TCCAGCCTGG 
3401 GTGGCAAAGC GAGACTCTGT CTCAAAAAAA AAAAAAAAAA GCCAGGCCAG 
3451 AGAAACTGCA TTTCCAAAGA CTGCCAACAG AAAAGAAGGG AGTGTCCAGG 
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3501 ACTAATGGCT TGAGCTTGAG AGTGGTGTGA GGTGCTGGGG CATGGAACTT 
3551 CCCTGTAGCC CTGCTCCCTG ACCTGGGGCA CTACGGTCAG GTGCTGCTCC 
3601 TCCCCTCTTC TCGGCTGCGT TTTCCTCCTC CCTCCACCCA GCTCATCCCC 
3651 AGCCTCAACT GCCACTTCTG CTCCTCTGAT GCCCAGGGTG TATTCCCAGT 
3701 GATCACCTGC CCAGAGCACA GCTGTCTTCT AGGTGCACAC CCACATGTCC 
3751 AAAGATCAAT TATTTTCCTC TCCTGGCATG GCCTCTGTGA CGCCCACTAG 
3801 TCATGGTGGC TGTGACATCC ACTAGTGCCT CAGCCAGACC CGTGACTCAC 
3851 CCTGGACCCC TTCCTGTCCC TTCCAAGATT TTTCACCACT ACCCATGCCA 
3901 TGCCATGCAT GAGACTATGG CCTCCTAGAG GGTCCCTAGA TGCCCCTCTC 
3951 GCCTCCTCCC TTACTGCTCG GTGCACACCA CGCAGCAGCC AAGCTGAACT 
4001 TTCACACCAG GCATCATGAG AGCCTGCAGC GCCTGCTTCT ACCCTCAGGA 
4051 ATTCCCCCAA CCCTGCCCAT GACGGTGTCC ACACTTTCCT CCCAATCCTA 
4101 ATGGCTGCCA CTCCCAGCAC CATCTGGCCA GCCCTCACCT TCCCTTCCTG 
4151 GGCATACATT CCCCAAATTC ACAGTGCTCT CACGAGCAGC ACTGGAGGGT 
4201 CAGCCTTTCT TTCCAATGTC CTCGGCCACC CGTTGACCAC AGACACAGCT 
4251 TTCCCTCTTC TCCCTTGGCC CCTGCCATGC CAGTGCTGCT GTGTGTGAGA 
4301 TGGGAGACTC ACCTCGTCTC CATCCTGAGC AGGTGCTGGG CCCAGCTCTC 
4351 CCTTGGATCT TCAGTACTAG AAGCAGCAGG CTGTTGGAAT ATTCTGGTTG 
44 01 GAGCCAGGCA TGGTAGCTGG AGCCTGTAGT CCCAGCTACT TGGGAGGCTG 
4451 AGGCAGGAGG ACCTCTTGAG TCCAGGAGTT AGAGGTTGCA GTGAGCACTG 
4501 ATCACAACAC TACACTCCAG CCTGGGTGAC GAAGTGTAAT CCTGTCTCTA 
4551 AATACACACA TACACATGCA CACACACACA CAAATTTTGG TTGAGACAAG 
4601 AGACTTGTCT CAAGAGATGG ACATGGGCAC AAGGCTTCCT GGTCTCAAAA 
4651 ATGGCCAGAA CCACTGCCAG CCTCCCATCT CTGCTTCAGT CTGCCTTACA 
4701 GGGGGACAGG GTTAATGACT TGATGGGGCC AACATCCCTT CCCTCATAAA 
4751 CCAGGCTGCC GGCTTCCGGC CTTTCCAGTC AACACGAGCC CAGCCAGGCC 
4801 AACCTTGAGA CTTGCCTCCT AGGGAGAGAA CGTGTTCTTC TTGGTGACCA 
4851 ACTTCCTTGT GACGCCAGCC CAAGTTCAGG GCAGATGCCC AGAGGTGAGT 
4 901 TTACCCAGGA TCCTCCCAGC GGGTCCCTTG TTCCTCCATC AGCCCCAGGT 
4951 GGCCACCCGT GTTTCCCTTT CCCCTTCCCA GGTGGCTGAA GGCTCAGCCT 
5001 GTGCTCGGTG TCCCCCAGGC ACTGGGCTAC ATCTTTTCCT GAATCATTAT 
5051 GTTCAGTCTT CACATATCCC CTGCCTGGTA GGAAGTCCTG TGATCCCCAT 
5101 TTCAGAGGAG AAGACTGAGG CTCAGTGAGG TTGAGTCACT TTCTTAAGGC 
5151 CTCCAGGCCT GTGGGTGACA GGACCCCGAG CTCTGGGCAG CAGCAGTTCC 
52 01 CATGAGGTGT CCAGGCCCTC CCATCCTGGT CCTGCCTCTG GGTACTCTCC 
5251 AGGTTGGTAG TGTGACACCC AGAGCTGCGC ACATGCTCAG GGAGGTTCTA 
5301 ATAGCAAGAG CCAAGCTGGA ATATCACCTC CCCTTGTCTG TGCCCAGCCT 
5351 CTATTAATAT GTCCTGAGGC AGCTTTCATC TTTGTGGGCC AACACAGCAC 
54 01 ACTCTTGCTC ATGGTGAATT CAGGATTGCT TATGATTTCT GGATAGTTTT 
5451 TTTTGTTTTA TTTTTGAGAC GGAGTTTCAC TCTGTCACCC ACGCTGGAGT 
5501 GCAGTGGCAG ATATCAGCTC ACTGCAAGCT CTGCCTCTCA GGTTCACGCC 
5551 ATTCTCCTGC CTCAGCCTCC GGAGTAGCTG GTACTACAGG CGCCTGCCAC 
5601 CACGCCCAGC TAATTTTTTT TTTTTTTTGT ATTTTTAGTG GAGACGGGGT 
5651 TTCACGGTGT TAGCCAGGAT GGTCTCCATC TCCTGACCTC ATGATCCACC 
5701 TGCCTCGGCC TCCCAAAGTG CTGGGATTAC AGGCGTGAGC CACCACGCCC 
5751 GGCCTGATTT CTGGATAGTT TTTACATCAA CCGTGGTCAA GCCAGAGTCC 
5801 CCCACCTTGT TCTTCTTCAT TTCTGATCCA GAAATGCTGA TTCTCCCCCT 
5851 GACATTTCAC CTTTTCCCCT TGCCTGGGGA TGTCCCTGGG ATCCTGCATC 
5901 TGTCACAGAG CATGCTCATT CTCTCCAGCT GTGAATTTTG TTTGAACTAT 
5951 TGGGACTCAG GACATAGTCC TGAAAGTTTA CCTCCACAGT GACATCTTTA 
6001 GGCAAGTCCA ACATTTACGT GCCTCCTGGG CTGGAGGGTC GTTGTGCAGA 
6051 CAGCTGTCCC CTGAGCCCTG GTGGCTGGTC CTAGCACAGT TGCTGGAGAC 
6101 ATCCCATGTC CGTAGTTGGA AATATGCACA AAGGATTGCT TACTCTTTTT 
6151 GTTTGTTTGT TTTTTTGAGA TGGAGTCTTG CTCTTGTCCC CAAGGCTGGA 
6201 GTTCAATGGC ACGATCTCGG CTCACTGCAA CCTCCGCCTC CTGGGTTCAA 
6251 GCAGTTCTCC TGCTCACCCC CTGAGTAGCT GGGATTACAG GTGCCCGCCA 
6301 CTGTGCCCAG CTAATTTTTG TATTTTAAGT AGAGACGGGG TTTCACCATG 
6351 TTGGCCAGGC TGGTCTCGAA CTCCTGGCCT CAGGTGACCC ACCAGCCTCG 
64 01 GCCTCTCAAA GTGCTGGGAT TACAGGCGTG AGCCTGCCGA GAGCTTGGTC 
6451 GGGGAGACCT GAACCCAGCG GTGCTAAAGG AATTAAAGAC AAACACACAT 
6501 AAATATAGAG GTGTGGAGTG GGAAATCAGG GGTATCACAG CCTTCAGAGC 
6551 TGACAGCCTC GAACAGATTT ACCCACATAT TTATTGACAG CAAGCCAGTG 
6601 ATAAGCATTG TTTCTACCAG ATTATAGATT AACTAAAAGT ATTCCTTATG 
6651 GGAAACAAAG GGATGGGCTC TGGTTGGTTA TCTGCAGCAG GAGCATGTCC 
6701 TTAAATCACA GATCGCTCAT GCTATTGTTT GTGGTTTAAG AACGCCTTTA 
6751 AGCGGTTTTC CGCCCTGGGT GGGCCAGGTT TTCCTTGCCC TCATTCCGGT 
6801 AAACCCACAA ACTTCCAGTG TGGGTGTCGT GGCTATCACA AACATGTCAC 
6851 AGTGCTGCAG AGATTTTGTT TATGGCCAGA TTTTGGGGGC CTCTTCCCAA 
6901 CATGAGCCAC TGTGCCTGGC AGGATGTGCT TACTCTTGGT GAACCCACAC 
6951 AATGTCCTTC TCTTTCTTAA TGCTCAGATG TGCATTTAGT GTTCAGTTTG 
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7001 TAGACCGTTC TGAAATTTGG CTGGATCTGT GGGTCTGTGT TTTTCAGAAT 
7051 CTGTGCAATT CCTCTTTGTC TGCAACCACA CTTCTGGCTC TTCCCATGAA 
7101 ACGTCAGGGC TGGGTCGTAA TTATCAGATC TGACAACCTG GCTTTCCCGG 
7151 AAGACCAGAG TTCTGCCAGC TCCTCTAGGG ATCCTGGTGC CTGATCCCTC 
7201 CCTTACATGC ACCATGCTCT TTATAGTGTC ACCTCCCTCA GCAGACACCG 
7251 CTGAGCCTCC CCGCTGGGCC AGGGGGCTAG CTAGGCTAAA TTCACAAAAC 
7301 TCCATCTCCC ATACTTCAAA GACCACCCAC ATGGACAGCC CAGCCCAGGT 
7351 GGCAGGTCCG ATGATGGGAC AGAGGCTGTA GGTGGGGGAC CTAGGGCTGC 
7401 ACTTGAGCAG AATCTTTTTT TTTTTTTTCT TTTTTTTTTT TTTGAGACAG 
7451 AGTCTCGCTC TGTCACCCAG GCTGGAGTGC AGTGGCGTGA TCTCGGCTCA 
7501 CTGCACACCT CCACCTCCTT GGTTCAAGCG ATTCTCCTGC CTCAGCCTCC 
7551 CAAGTAGGTG GGACTACAGG CACACACCAC CACACTCGGC TAATTTTTGT 
7601 ATTTTTAATA GAGACAGGGT TTTGCTGTGT CGGCCAGGCT GGTCTCGAAC 
7651 TCCTGACCTC AGGTAATCCG CCCACCTTGG CTTCTCAAAG TGTTGGGATT 
7701 ACAGGTGTGC CAGGCCAAGC AGAATCTTAA AAAAAGGTGG GGAGAAGCTG 
7751 GTGAGCAGGT GGATTTGGTT GAAGCAGGAT GTCGACACAG AGGGGGCTTG 
7801 GTGGGTAAAG GCCCTGAGCT GTGTGAGGTG AGGTGCCTTT AGGGCTACCT 
7851 GCCACTGGGT GGAGCTGAAG TGAAGATTTG GACTGGGGTG GGAAGAAGGT 
7901 AGTTCAGGAT TTCAGGGGCC CCTGTAAGCC CCACTAAGGA GCTAAACTGT 
7951 TTTTGTTTGT TTGTTTTCTT TTTCTCTTTT CTTTTTTTTC CTGTAGCAAT 
8001 GAGGTCTTGC TTTGTTGCCC AGGCTGGTCT CGAACTCCTG AGCTCAGGCA 
8051 ATCCGCCTAC TTTGGACTCT CAAAGTGCTA GGATTACAGG CGTGAGCCAC 
8101 TGTGCCTGGC AGGAGCTAAA CTTGATTAGA GGAACAGAAG AGAGCCACAC 
8151 GTGGGCTCAG AGGCAGGGTG CTCAGTTTCC TGCACATTGG GATGCACCAC 
8201 TTGGGCTGCT GGGCATAGGT GGATGAGGGT ATGGGAAGAC GTGGGGGCCC 
8251 CACTGGTGGT CACTGTGGGG TCTAGTTGGA GGAGACGGTA GCCCAGCTGG 
8301 GGTGAAGAGG AGAGGCAGAC ACAGGACATA GGTAGGGACA AAGAAGCAGA 
8351 GCATGTGGCT CTGCTCCGAC CTCCACCCAA TCACGACGGC CCTGTCTTTC 
8401 AGAAAGTCCC ACCGCCTCAT TCTGGCTTCT CAGAGGCCCT CAGCCTTCCT 
8451 TGCGCCCCTG GTGCTGGTGT TCTTCCTGCT GCCCCTGAGC TGAGTGCCCT 
8501 GGGCAGCAGT GTCCATCCTC AGTTGGGGCA GGACCATGCC TGGGAGAGTG 
8551 CCCGATGCTC AAGGGTGCCT TCGTCTCTGG GGTCTGGGAC CCCAGAAAGC 
8601 TCACCTGTCC TCCCCTTCTG CCAGAGCCCC ATAGTCCCAT GCCTCTGTGC 
8651 AGGCATTAAT GTCCCCAGGT TACAGAAGAG CGAGCAGGAA GGAGTAGCCT 
8701 GTGGTCCCTC AGCAAGGGTG TGGGGTCCTG CTTCAATACC CAAGCCCCTG 
8751 ACTCTAGGGC CCTGATCTTT GTCAGCTATG TCCCCATGCC GGGCATCAAA 
8801 AACTCACCCT CCCAAGGTAT CTTCACCTTC CCTGATCTGT CATCCAAATT 
8851 GGACCAGAGG AGCTAGACCT GGAAGAATCA CTTCCGCATC CACCAGGGAC 
8901 AGAACTGTCA GGAGGGAAGG GGCAGGGTGC GTTGTCTCAC GCCTGTAATC 
8951 CCAGCACTCT GGGAGGCTGA GACAGAAGGA TTGCTTGAGG CCAGGAGTTA 
9001 AAAACCAGCC TGGTCAACAT AGCAAGACTC CATCTCTACA AAAAAAAAAT 
9051 ATTAAAAAAT CAGCCAGGCA CAGTGGTGTG TGTCTGTAGT CCCAGCTACT 
9101 GGGAATACTG AGGTGAGAGG ATTGCTTAAG CCCGGGAGGG CGAGGCTGTA 
9151 GTGAGCCATG ATCATACCAC TGCACTAGAG CCTGGACAAC AGAGTGAGAC 
9201 CGAATCACTA AAAATAAATT TTTTGAAAAA GGAGGAAAGG GGTCTCCCTT 
9251 TGTCTTTGAA ATACAGTACT GTACCTTCAT CTGGCCAGGG CATTGCTCCG 
9301 CTCCCTCCTC TGACCACCTC CTTTTATTTG CACCCTCCAG CTTTCCTGTG 
9351 TGGCCCCACA CTCAGGGTAC TCTGGCGGCG GGGTGGTGAG GTTGTTTAAG 
9401 GTGGGAAGGG GGCCTGTCCT TCCCACCTTG AACCTCCCTG CCTTTGAGAC 
9451 TGGGCTGTGG AGGGGAGACA TCCCCTGTGC CATTGGTGAC TGCTCTCTCT 
9501 CCCACCTCAG CACCCGTCCG TCCCACTGGC TAACTGCTGG GTCGACGAGG 
9551 ACTGCCCCGA AGGGGAGGGA GGCACACACA GCCACGGTAA CTGTGGGCTC 
9601 TGTCTTCCAG TGCCCCTAGC AGGGTGGGGG CCGGGCTGGG ATCCTGGGTG 
9651 GCTCCTGAGT GCAGGCCCTG CTCGCCTCTG TCCCTGCATC TCTCTTTCTG 
9701 CCAACAACCC CCTGGCTGAA GGCCTCCCCA GGCCTGCAGA GATTTGAAGG 
9751 TCTGGAGTTC ATCTTTTGTT TTCTAGGTGT AAAAACAGGC CAGTGTGTGG 
9801 TGTTCAATGG GACCCACAGG ACCTGTGAGA TCTGGAGTTG GTGCCCCGTG 
9851 GAGAGTGGCG TTGTGCCCTC GTAAGTGTCC CCACAATCCC CTACCCCAAC 
9901 TGGCGCAGGG CCCCAGGCCT GGCAGAGGCT GTCACCTCCC TTCCACCTGC 
9951 AGGAGGCCCC TGCTGGCCCA GGCCCAGAAC TTCACACTGT TCATCAAAAA 
10001 CACAGTCACC TTCAGCAAGT TCAACTTCTC TAAGTAAGCA GAGTGGGTCT 
10051 CATCTGCCCC AAGACCCTCC TTGTCCCCTA CCTCATCTGA CCTTTCCCAC 
10101 TCCTCCCAGG TCCAATGCCT TGGAGACCTG GGACCCCACC TATTTTAAGC 
10151 ACTGCCGCTA TGAACCACAA TTCAGCCCCT ACTGTCCCGT GTTCCGCATT 
10201 GGGGACCTCG TGGCCAAGGC TGGAGGGACC TTCGAGGACC TGGCGTTGCT 
10251 GGTGGGTCCC AAGTTGGGGG CAGGGTTCCT AGAGGGCTCT GGGAGAGGGT 
10301 CCCGGGCCCA CCCACCGGTG GAAAAGCTAT GTGCTATGTG CAGGGTGGCT 
10351 CTGTAGGCAT CAGAGTTCAC TGGGATTGTG ACCTGGACAC CGGGGACTCT 
10401 GGCTGCTGGC CTCACTACTC CTTCCAGCTG CAGGAGAAGA GCTACAACTT 
10451 CAGGTGAGGC CCCACTGCTC CCAGTGCCCA GCTGCTGGGC CCATCGCCCT 
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CTCACTGTGG 
TTCCACTACG 
ACCCCACATC 
CCTCCACCCC 
CAGCCCTTCC 
TGGGGTGTTT 
TTTTTTTAAG 
CCGACCTCGG 
TTCCTCAGCC 
AGCTAATTTT 
CAGGCTGGTC 
CCGAAGGGCT 
GCTTTTACCT 
GCCCAGGCTG 
TCCCGTGTTC 
ACGGGTGCAC 
AGGGTTTCAC 
CCACCTGCCT 
CGCCCGACCT 
CTGGAATGCA 
TTCAGGTGAT 
TGCACCACCA 
TGCCATGTTG 
CACTTCAGCC 
GTTTTTTTTG 
GTTGCCCAGA 
GCCTCTCGGG 
ATTACAGGTG 
GATGGGGTTT 
GTGATCCACC 
CACTGTGCCT 
GAGTCTTCTA 
GTCATGCAGG 
AACATAGCTC 
CCTTCGAAAC 
CTCTGTGCTG 
TGTCCCCTCC 
TCTCTGGAAT 
GGGTCTGTTT 
CTTCTGAGGG 
ATAGGGCTGA 
ATTGACAAGT 
GCAAAGCAGG 
CTGGTGGGAG 
GAATCCGCTT 
CAGGCCGGGG 
GCTGACAGTC 
AAGTTCGGGC 
GCTGGGCGTG 
TGCTGCTGAC 
ACATTGAGTC 
GGGGCTGGAA 
GACCTGCTAC 
AAAGTATGAG 
CCACACTTAG 
CAGGCCAAGG 
CCTTGCATCC 
CTGCACCCAC 
CCCTGTCCAA 
GCCGTTCCCT 
CTGGGGATCT 
CCACCCTTGA 
GGGTGCTGCC 
CAGAACCTGA 
AGGGCTCCGA 
GTCTCTCTCC 
CCAGCTGTCA 
CCATCTGCAC 
CATGGGGCTG 
TGGGGGAGGG 



Title: 

CGGCCAGGAC 
TGTGCAAGGG 
CTCCAGCACA 
ATCTGTCCCA 
TCCACCCCAC 
CATTTTTACT 
ACAGAATCTC 
CTCACTGTAA 
TCCTGAGTAG 
TATATTTTGG 
TCAAACCCCT 
AGGATTACAG 
AAACTTTTTT 
GAGTACAGTG 
ACGCCATTCT 
GCCTCCACGC 
CATGTTGGCC 
CAGCCTCCCA 
TTTTTTTTGA 
ATGGCGTGGT 
TTTCCAGCCT 
CACCCAACTA 
GCCAGGCTGG 
TCCCAAAGTG 

CTGGAGTGCA 
TTCAAGCAAT 
CACACCACCA 
CACCATATTG 
TGCCTCAGCC 
GGCCTCAAGT 
AGCAGATCTA 
CAGGCACACA 
ACTGCACCCC 
CACGTGGGGC 
GCAAACCACC 
AGGAAGTCTA 
CCCTCAGCCC 
AGGGACTCCT 
ACAGGTCTTC 
GACCCAGGAC 
TGGACAGAGG 
TCACCAGAGC 
CAACCGGGTG 
CGACATCCTC 
ATGGGATGGG 
GTGGGCTGAG 
TCATCCCCAC 
GTGAGTGCGA 
CAGGGTGTGT 
TCGGGGTGCA 
CATGGGTTGG 
TGCTGTATGT 
GAGGTGAGCT 
GAAGATGTTG 
CCCCGAAAGC 
CAAGCCCGAC 
GGCCACTGCT 
GTTCTGACAC 
GCTGGTTGAG 
CAAGGATGAG 
ACCCCAGCAG 
TCAGGGAGCC 
CCCGTGGAGA 
CTGCATGTGG 
CAGTGCTCTG 
GCACAGACCC 
CCCCATCATA 
TGCAGCTGGA 
GGATTGTTTC 
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AGACCACACC 
GGTCCCAGGA 
GGCTCCGTCC 
GGCCCCTGCC 
CTCGCTTCTA 
TTCCCCTTCT 
ATTCTGTCAC 
CCTCTGCTTC 
CTGGAATTAC 
TAGATAGAGA 
GACCTCAGGT 
ACGTAAACCA 
TTTTTTTTTG 
GCGGGATCTC 
CCTGCCTCAG 
CCGGCTAATT 
AGGATGGTCT 
TAGTGCTGGG 
AACGGAGTTT 
CTTGGCTCAC 
CTGCCTCCAG 
ATTTTTGTAT 
TCTCGAACTT 
CTGGGATTAC 

GTGGCCCGAT 
TCTCCTGCCT 
CACCCAGCTA 
GTCAGGCTGG 
TCCCAAAGTG 
TTCATAAATT 
TTGGATCCTT 
CGACCACCAG 
GTCTGATCTG 
TGGCTCCCAC 
TGTGTGCATG 
CCCCAGTTCC 
TGAGCCTGCC 
CTGTCCCCCG 
GTTTGCTTTT 
TCAGGCTTGG 
TCTGGCAGGG 
CTTCTTTCCT 
TGGAGGCCCG 
GTCACCGGGC 
GCAGGCAGAC 
AGGTTCAGCT 
GGCCGTCACA 
GCACTGTGGG 
CCAATGCATG 
GGCTGGGGAG 
CCCTGCCTCT 
GGATAGAGAA 
GAGGTCGCTC 
GCTGGATCCC 
AACCGCCAAC 
TGGCCGAGTG 
GCTGGGAGTC 
CCACTTGCCA 
AGTTGGGGGC 
GCCCCAGCAT 
ACAGTCCCTC 
ATAGAAGTCG 
CTGGGAGAGC 
CAGGGGCTCC 
TCCCCAGTGT 
TCCTGCTGCC 
GGTAGAGACC 
GCCAAAAAGG 
AGCTTCTCTG 



CAGGCCCAGG 
GCAGGAGAGA 
TGCTGCCCCA 
CAGCTCAGGC 
GTATCTCCCC 
CCCCTTCAGC 
CCAGGCTGGA 
CTGGGTTCAA 
AGGTGCTCGC 
TGGGTTTTCA 
GATCCACCCA 
CCATGTCTGG 
AGATGGAGTC 
AGCTCACTGC 
CCTCCCAAGT 
TTTGCATTTT 
CGATCTCTTG 
ATTACAGGCG 
TCACTTTCTT 
TGCAACCTCT 
AGTAGCTGGG 
TTTTAGTAGA 
CTGACCTCAG 
AGGCATGAGC 
TTAGATGAAG 
CTCGGCTCAC 
CAGCCTCCTG 
ATTTTTGTGT 
TCTCGAACTC 
CTGGGATTAC 
GCATTTATTA 
CTGCCACCGA 
GCCTGGGGAT 
GCTTCCCCAA 
CCACATCCTG 
TTCCTTCAGG 
CAGGGAAGAG 
CCTTCACATC 
TCCTCTCAGC 
TCTGTTTTCT 
GCTGGGGGTT 
CCAGCCCCAC 
GCCCACAGGA 
CACCCTGCTC 
AGGTAGGCAC 
AGGGCTGGAG 
CAGATCTCTC 
CTGGGCACCG 
CACCTGCAGG 
CTGGAGCCTC 
GTGGCAGGAG 
CCCAGGTCAC 
GCCCATTTCT 
TGCTTGGACC 
TGACCTGCTG 
TCTGTGTGGA 
CCTCAGACGG 
AGACACAGAC 
ACCCATTCCG 
TGGGAAGGGC 
GGAGGATTGG 
CCCTGACTCC 
GCTGTGTTTT 
CCAGCAGGCA 
TGCTGCGTCT 
TCCTAGCAGA 
TGGGTCCTGG 
CCACCCTCCC 
CAAGGTAGAA 
GTGCTGTGAT 



CCTCTAGATA 

GCTGTTCTCA 

AGTCCTGAGC 

TCCTCACTGC 

TCCACAGCAA 

TTTGTTTTTT 

GTGCAGTGGC 

CCGATTCTCC 

CACTACTCCC 

CAATGTTGGC 

CCTCAGCCTC 

CCTCCCTTCC 

TCACTCTGTC 

AAGTTCCGCT 

AGCTGGGACT 

TAGTAGAGAC 

ACCTCGTGAT 

TGAGCCACCA 

GTAGTCCAGG 

GCCTCCTGGG 

ATGACAGGTG 

GATGGTGTTT 

GTGATCTGCC 

CACCAAGCCT 

TTTTGCTCTT 

TGCAATCTTT 

AGTAGCTGTG 

TTTTACTAGA 

CTGACCTCAG 

AGGTGTGAGC 

TCATGTCTTT 

GCGTCACCTC 

GATGCCCCTC 

CCTCCCCAGC 

TTCCCCTGAC 

CCCAGCCTCA 

TGAGTTCCCA 

CCCCGCTGCT 

AGGCAGGGAA 

CACCAATTAC 

TATAGAGTCA 

CTGGGGGTGG 

CAGCCACTCA 

AAGCTCTATG . 

AGGTAGGGGT 

GAGGCATGAG 

TCAGGCAGGG 

GGGCAGCTTG 

CTGCAGTGAG 

CGGTGCCTGC 

AGCAGGCTCG 

CTTTTTCTGT 

ACTGGAGGAC 

CTGGGTTCTG 

TCCTCATCTG 

GGGAGCTGGC 

AGCTCAGCAC 

ACCAGGATGG 

GGAGCCTGTA 

GGGGCCCTGC 

GGGTAGAATT 

CACCTTGGTA 

GAGACGGCGA 

CCTGTATTGC 

GGGCCTGGAG 

GGTATGCTTA 

CCCTCCTCCC 

ATCGGTCCTA 

AGAGGAGTGA 

GCCCCAGGAG 
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14001 AGTCCTAATC TAGGGAATGG GGTGGAGTAG GCAGATAATC CACCTCCCTA 
14051 TCCCCCAGGC AAGGGCGGAG CATGTGTCTT GGGCCCACAC CTGCTTAGTT 
14101 TATGAGGACC GGCTGCTTTC CAGTGGTAGC CCTTTTGCCA TGGAGGTCTG 
14151 GGAGAGAGAG CAGAGGGCGG CAGGGCTAAG TTGGTGATCA TTGGGTTCTT 
14201 CAGGACCTTC TATATCCCTC CTCGGTAACC CCCCAGCCCA ACCCCTTGGA 
14251 ATCTTTCCTC CAGGCTTCCT GAGAGCCCTG GGGGTGGGAG GCTGTGGGAG 
14301 GCTGTACATC TGAAATTCAC TTCAGTCCAA GTCATACCTA GGAAGCTGTC 
14351 TGGGCAGCTG CTCGAGGGAG GCCCTGGCTC TGATCCCAGG CTGGATGGAG 
14401 TGGCTGGAAG GAATGGTTCC AAACAACACC ACCGAGATCT CCCTCAGGCT 
14451 GGCCAGGTTT TGCAGCTGGA ATTCTCCTCT TGGTCCCAGG GCGGGGCAGG 
14501 GAATTCTAAG TGTCCACCCC AGGGAGGCAA GGGGCTGCTT TCCACTGTGG 
14551 GTACCTGGTG ATCAGGGCAA GCTGTGGAGG GCCAGGGGTG GGGCTGAGAC 
14601 TGGGCTGACA TCTAGAATCA CCTGCCACCT GGAGCCTCAG TAAAATGCCT 
14651 GGGGTCCCTG CTGCCTCTCA ATCTCCAGAG CCATGTCCAT GGGGAGGTGG 
14701 GCTCTGAAGG GCGAAGGTGG GAGAGCAGGG CCCCTGAGGC CTGGGTATCC 
14751 AAGGAGGGGC ACGTGCACCT GATTCTCCTT GGGGCCCAGA GGAAGCTGAT 
14801 GTCATGGCTG GACAAAGTCA CGGAGTAAAG CCAGCAAAGC CACCCTCTTC 
14851 CTGTGTAGTC CTTACAGGCA TGACTGGAAA GTTGGGGGGC ATCTATGGTA 
14 901 GACATGGCAC AGCCATGAAG AGACCAGTGG GGTGGTGCAG GGTGGACTTG 
14 951 GGGACCCTAC CCCTGAAGAC TGAGGCCCTG CAGCTACCAG GTGGGCTAGA 
15001 AGGTAACTGG AACAGGCCTG GGCACTTGTG CACCCATGTA GGAGCATGAG 
15051 GGCCACACTC TTTTCACCTC AAAGCCCTTG AAGAGTGGGC AAAGACAGCA 
15101 AGAGAGCTGC AGCCTGGGCC CGAGCTCAGA AACAGCTGTC GCCTCAGTCT 
15151 GCGCACAGGC ATGCACCCCA GGGTAGTGCC TGCAGGGATG CATGTGTCCC 
15201 CGTGGGGGTG CCTGTGCCAG GCAGGCCTCA GGTGCATGCC ATGCTCAGAA 
15251 CCCTGCTGCC CTTTCTAGGC AGCCTCCTTG GGGCCCAAGC TCTGCTCCCT 
15301 GGATCTGCCA CCTAGCAGAC GTGGGGAGCC TGACCCCATG CCTGTCATGG 
15351 AACCCTCCTT GCCTGGTGTG TGTGGCTCCC CTCTTCACTG GGCACCTGGA 
15401 TCCAGGCCCA CCTGTGTCCC TGACTCAGGG TGGTCCCAGG ACTGGCACCT 
15451 ACTCTTTAGA GAGCCCCAGC ATCTTTGATG TGGATTGGAG ACAATTGCCT 
15501 GGTTCCCTGG GGCAGGTGAA GACTTGGTGC CACAAAGAAT GCCACAGTGG 
15551 ATACGCCAGC AGGCCACATG GCTGGCCAAG CAATTATTAT TATGGATCCC 
15601 TTGGGCTGTG GGCCTTCCCA TCCACCCCAC CACAACTGCC CAGGTAGCTG 
15651 GAGCTGATCA TAAACAAGAA GGCTCTGGGC AGAGTCCATG GCACCAGCAC 
15701 CAGCCAAGGC CCACTCCTGA AGACCCGAAG CCCAGCCCCT GGATGAAGGT 
15751 CCTAAGGTCC TGAGGACTCC CCAGCCTGTG CAGGCCTGCA AACCCAGGCT 
15801 GCCCACAACA GAAGGGGCTC TCGGCTTGTC TGGCCTCTCT GGCCTCCCAA 
15851 GCAGGTGTGG GAGGGCGGGG CAAGTGTGGG CTGATCAGCT ACTCCATATG 
15901 GCCAGGGTCC TGTGCTGGTG CCTGGCTGGG GGGCTGCATA GCCTGCACTG 
15951 TCTCCTCCAG GCTGCCCCTG GGGAATACCA CGTAGTGTGT GGAGTTCAGC 
16001 CCTGGCAGCT CCCGCTGGTT CTCCTTGCTA TGCCGGATGC CATAGCCGAA 
16051 ATACACTGCA AGTCCTAGAC AGGGCAGGAG GCAGGGCATG AGCCTGAGGT 
16101 ACAGGTTCCA GCCCTTCCTG TCCTCTTTGC CCTCCTCCTG ACCCCGGTCC 
16151 CAGCCTGGCC CCCACTCACC CATCAGCAGC CAGATGGAGA AGCGCACCCA 
16201 GGTCAGATAG CTAAGTTTCA GCATGAGGCA GATGTTGAGG ACGATGCTCA 
16251 GGGCTGGAAT CAGGGGAACC ATGGGGATCT GAGGAGGCAG AGGCAGGGCA 
16301 GGGCTGGGCC GGGCTGCAGG AAAGATCTGC CAGCCCAGGG CTCACTTTCT 
16351 CGGGAATCCA TAGAGCCTTT GTTCCTCACG GGAGATTGTG GAGACATGTG 
16401 CTCACTCACC ATGCAGAAAG GGGTGCGGGA TGGGTGTGTG GTCCTCCCC 
(SEQ ID NO: 3) 
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Intron: 

Exon: 

Intron: 

Exon: 

Intron: 

Exon: 

Stop: 



12733- 
12845- 
12911- 
13086- 
13164- 
13254- 
13449 



12844 
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SNPS: 



DNA 



Position 


Major 


Minor 


Domain 




136 


C 


T 


Beyond ORF(5' 


') 


253 


T 


C 


Beyond ORF(5' 


') 


573 


C 


T 


Beyond ORF(5' 


') 


2000 


A 


G 


Beyond ORF(5' 


') 


2222 


G 


C 


Intron 




2783 


G 


T 


Exon 




3199 


G 


A 


Intron 




3307 


C 


G 


Intron 




5012 


C 


G 


Intron 




6169 


G 


C 


Intron 




7647 


A 


G 


Intron 




8638 


C 


T 


Intron 




9409 


T 


G 


Intron 




10504 


A 


C 


Intron 




10971 


T 




Intron 




12609 


G 


A 


Intron 




13367 


T 


A 


Exon 




14191 


T 


C 


Beyond ORF (3 ' 


) 


14227 


A 


G 


Beyond ORF (3 1 


) 


15027 


T 


C 


Beyond ORF (3 1 


) 


15441 


A 


C 


Beyond ORF (3 1 


) 



Protein 
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136 TCTCCAAGTCCATGGGTGCCTGGTAGGAGACAGGGGGATGAATGTGAACCCCTGCATGGC 
TATAGCCACCTGCCTCCTCCCCTGCCCTGCATCACTACCTGGCCTATTTTTTGCCTCTAG 
AAGCACTGCTTCCTA 
[C,T] 

GCTCCTTAGGACCACTGCCCGCATATGACAGATAAGAACATCGAGGCTAAGGCAACGCAA 
ATCTTTTCCTTAAAGTCATACAGCTGTCAAAAGAAAGCTGGACAACCTGGGCAACATAGC 
GAGATAAAAAATTATTTAAATTAGCCAGATGTGGTAGCCCCCTGTAGTCTCAGCGACTCA 
GGAGGCTGAGGCAGGAGGCTCACCAGAGTGCAGAGTTCAAGGATGCAGTGAGCTATGATC 

CTGCCACTGCACTGAAAGCTGGGTGACAGAGCAAGACCCTGGCTCTAATAAATGAATACA (SEQ ID NO: 5) 



2 53 TCTCCAAGTCCATGGGTGCCTGGTAGGAGACAGGGGGATGAATGTGAACCCCTGCATGGC 
TATAGCCACCTGCCTCCTCCCCTGCCCTGCATCACTACCTGGCCTATTTTTTGCCTCTAG 
AAGCACTGCTTCCTATGCTCCTTAGGACCACTGCCCGCATATGACAGATAAGAACATCGA 
GGCTAAGGCAACGCAAATCTTTTCCTTAAAGTCATACAGCTGTCAAAAGAAAGCTGGACA 
ACCTGGGCAACA 
[T,C] 

AGCGAGATAAAAAATTATTTAAATTAGCCAGATGTGGTAGCCCCCTGTAGTCTCAGCGAC 
TCAGGAGGCTGAGGCAGGAGGCTCACCAGAGTGCAGAGTTCAAGGATGCAGTGAGCTATG 
ATCCTGCCACTGCACTGAAAGCTGGGTGACAGAGCAAGACCCTGGCTCTAATAAATGAAT 
ACATAAAGTCTCACAGCTAGTGGTAGCTAATCCTGCCAGAGTCAGGCCTCTACCTGTCTG 

ATGAO^TGGCACACTATGTCTTTTAACCTGATTGCAGACCACAAATGTTTTGTGAATA (SEQ ID NO: 6) 



5 73 TAAATTAGCCAGATGTGGTAGCCCCCTGTAGTCTCAGCGACTCAGGAGGCTGAGGCAGGA 
GGCTCACCAGAGTGCAGAGTTCAAGGATGCAGTGAGCTATGATCCTGCCACTGCACTGAA 
AGCTGGGTGACAGAGCAAGACCCTGGCTCTAATAAATGAATACATAAAGTCTCACAGCTA 
GTGGTAGCTAATCCTGCCAGAGTCAGGCCTCTACCTGTCTGATGACAAATGGCACACTAT 
GTCTTTTAACCTGATTGCAGACCACAAATGTTTTGTGAATATTTTCCCCAGGGAAAAAAC 
[C,T] 
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GGAAGTAGTTCTAAATTCTATACATCCATTATATTAGTTTTACCTGTGGATTGGGAAAAC 
CCAGCTCTGATTGCATTTCAGGGCGGGACAGCCTTTGGTGCACTGTCTGGCGGGATTTTC 
CATTTTAACCTCCTTCTAGAAGCGCCTTCTCATGGTAAAGTTCCTGATGCCGCCAGGAGC 
GCCGAGGAGAGGGCAGGGGGCTGGAGACGCCCCGCAGAGGGCTACGTGCCCTGCTGGACA 

GAGGTCTCCTGCCTCCTCGGCGGCGCCAGCCCACCTCCCACAACCCCTGCGGGAGAAGCC (SEQ ID NO: 7) 

CTCCTCTCACAGCACTGATAACAGCTGTCCGTCTCCACCCTCCCACCACCTCCACTCCCA 
CCCCAGGAAGTGAGGCCAGAGGGCAGGGACAGAGCTGCTGCTGTTCTCTGTGTGCCAGGG 
CCCAGCAAAGGGAATGTAGGGAGGGTGGGAGGTGCAGGGCAGCTGGGATTAGGGGTTGAG 
GGCTGGGTGTTGGAGGCTGGATCTGGATCCTGCTTTAGTGGAAGTGTCCCTTTAACAGCA 
ACTGGCCTGGCCTGGCTCGGGCCCTGCTTTGCCTCCTGTTCAGCTGCGGCTGCAGCTGCC 
[A,G] 

TGCTGACTCATGTGCCCGCAGCTAGCAGGAGCTGGCAGCATGGGCTCCCCAGGGGCTACG 
ACAGGCTGGGGGCTTCTGGATTATAAGACGGAGAAGTATGTGATGACCAGGAACTGGCGG 
GTGGGCGCCCTGCAGAGGCTGCTGCAGTTTGGGATCGTGGTCTATGTGGTAGGGTAAGAG 
AGAAGAGCTTTTGGCCAGGCTGGAGGGGCAAGGGAAGAGGTGGGGGGTGGGGCTTGGTCC 

TGCTGGGTTGAAGTTGAGGGTTGGGCTGTTTAGGGGCTGGAGTGGAAGGGGGCAGATTGG (SEQ ID NO: 8) 

AGTGTCCCTTTAACAGCAACTGGCCTGGCCTGGCTCGGGCCCTGCTTTGCCTCCTGTTCA 
GCTGCGGCTGCAGCTGCCATGCTGACTCATGTGCCCGCAGCTAGCAGGAGCTGGCAGCAT 
GGGCTCCCCAGGGGCTACGACAGGCTGGGGGCTTCTGGATTATAAGACGGAGAAGTATGT 
GATGACCAGGAACTGGCGGGTGGGCGCCCTGCAGAGGCTGCTGCAGTTTGGGATCGTGGT 
CTATGTGGTAGGGTAAGAGAGAAGAGCTTTTGGCCAGGCTGGAGGGGCAAGGGAAGAGGT 
[G,C] 

GGGGGTGGGGCTTGGTCCTGCTGGGTTGAAGTTGAGGGTTGGGCTGTTTAGGGGCTGGAG 
TGGAAGGGGGCAGATTGGGACGGGGTTGGGGAGAGCTAGGCGATACAAGACAGGAGAGCA 
AGAACAAGCTGTGTGTTTGTCCTGTGTGTCCACTTGCCTCCTTCCCAGGCCCCCACCCAG 
GCCCCACCCAGGGGGCACATGACATAGTCCTTAACATCTGTGAGAGCTGGAGCACTAGGC 

CCCCAGAGAGACCACCAGCTGTATCTCGGGTCAGGAGAGTCTGTAAGGGGGAAGCTGGAT (SEQ ID NO: 9) 

GTATCTCGGGTCAGGAGAGTCTGTAAGGGGGAAGCTGGATCTAGTCAGGCTGGGGGTGGG 
TGCTGGCTAGTGAAGGTGATTGTCTGAGGGCATTGGCTCTCTGATGCATGGCTGGAGCTT 
CTGTCTCATTCAGGGGGTCTGGAGTGGGAAGTGGGGCCAGAGAGGAGGTGGGGCCTTCGA 
TGTTGGGCCGGGAGCCTGTAGGGTGTGGGGGGAGAACTGAGCATGTAGGGCTCAGCTCCG 
CCCCTGTCACTACACGCTGGGGACACACCACACTGCCCGACTTCTCCTCCCCAGGTGGGC 
[G, T] 

CTCCTCGCCAAAAAAGGCTACCAGGAGCGGGACCTGGAACCCCAGTTTTCCATCATCACC 
AAACTCAAAGGGGTTTCCGTCACTCAGATCAAGGAGCTTGGAAACCGGCTGTGGGATGTG 
GCCGACTTCGTGAAGCCACCTCAGGTGGGGGCCCTGATGTTGCTGACGGGGGCGCAAGTC 
CTTTCCCCACTGACAGCCTGAACACCCGCCATGCAGCCAGTGTGTGCGAGAGAGAAGCAT 

GTGATGCCAGAGACGGCTGCGGGTTCTCAGGAAGGGCTTCACAGAGGAGTGGCACCTGGA (SEQ ID NO:10) 

ATGTGGCCGACTTCGTGAAGCCACCTCAGGTGGGGGCCCTGATGTTGCTGACGGGGGCGC 
AAGTCCTTTCCCCACTGACAGCCTGAACACCCGCCATGCAGCCAGTGTGTGCGAGAGAGA 
AGCATGTGATGCCAGAGACGGCTGCGGGTTCTCAGGAAGGGCTTCACAGAGGAGTGGCAC 
CTGG ACAGG ACTTT CAGGG ATGTG T AGG AGGT TTTGGGGTGG AAAAAGGGG C C ACT C AAG 
AAGCCAGGCCAGGGTTGGACGTGCTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCC 
EG, A] 

AGGCAGGTGGATCACGAGATTGAGAGTATCCTGGCTAACACGGTGAAACCCCATCTCTAT 
TAAAAATACAAAAAATTAGCCGGGCATGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGG 
AGGCTGGGG CAGG AG AATGG CATG AAC C CGGGAGG TGG AG C TTGC AGTG AG C CG AG ATTG 
CACCACTGCACTCCAGCCTGGGTGGCAAAGCGAGACTCTGTCTCAAAAAAAAAAAAAAAA 

AG C CAGG C CAG AG AAACTG C ATTT C C AAAG ACTG C CAACAG AAAAG AAGGG AG TGTCCAG (SEQ ID NO: 11) 

GTGCGAGAGAGAAGCATGTGATGCCAGAGACGGCTGCGGGTTCTCAGGAAGGGCTTCACA 
GAGGAGTGGCACCTGGACAGGACTTTCAGGGATGTGTAGGAGGTTTTGGGGTGGAAAAAG 
GGGCCACTCAAGAAGCCAGGCCAGGGTTGGACGTGCTGGCTCACGCCTGTAATCCCAGCA 
CTTTGGGAGGCCGAGGCAGGTGGATCACGAGATTGAGAGTATCCTGGCTAACACGGTGAA 
ACCCCATCTCTATTAAAAATACAAAAAATTAGCCGGGCATGGTGGTGGGCGCCTGTAGTC 
[C,G] 

CAG CT ACT CGGGAGG CTGGGG CAGG AG AATGG CATG AAC CCGGGAGGTGG AG CT TG CAGT 
GAGCCGAGATTGCACCACTGCACTCC^GCCTGGGTGGCAAAGCGAGACTCTGTCTCAAAA 
AAAAAAAAAAAAAG C CAGG C CAG AG AAACTGCATTTCC AAAG ACTG CCAACAG AAAAG AA 
GGGAGTGTCCAGGACTAATGGCTTGAGCTTGAGAGTGGTGTGAGGTGCTGGGGCATGGAA 

CTTCCCTGTAGCCCTGCTCCCTGACCTGGGGCACTACGGTCAGGTGCTGCTCCTCCCCTC (SEQ ID NO: 12) 

TTAATGACTTGATGGGGCCAACATCCCTTCCCTCATAAACCAGGCTGCCGGCTTCCGGCC 
TTTCCAGTCAACACGAGCCCAGCCAGGCCAACCTTGAGACTTGCCTCCTAGGGAGAGAAC 
GTGTTCTTCTTGGTGACCAACTTCCTTGTGACGCCAGCCCAAGTTCAGGGCAGATGCCCA 
GAGGTGAGTTTACCCAGGATCCTCCCAGCGGGTCCCTTGTTCCTCCATCAGCCCCAGGTG 
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GCCACCCGTGTTTCCCTTTCCCCTTCCCAGGTGGCTGAAGGCTCAGCCTGTGCTCGGTGT 
[C,G] 

CCCCAGGCACTGGGCTACATCTTTTCCTGAATCATTATGTTCAGTCTTCACATATCCCCT 
GCCTGGTAGGAAGTCCTGTGATCCCCATTTCAGAGGAGAAGACTGAGGCTCAGTGAGGTT 
GAGTCACTTTCTTAAGGCCTCCAGGCCTGTGGGTGACAGGACCCCGAGCTCTGGGCAGCA 
GCAGTTCCCATGAGGTGTCCAGGCCCTCCCATCCTGGTCCTGCCTCTGGGTACTCTCCAG 

GTTGGTAGTGTGACACCCAGAGCTGCGCACATGCTCAGGGAGGTTCTAATAGCAAGAGCC (SEQ ID NO: 13) 

CTTGCCTGGGGATGTCCCTGGGATCCTGCATCTGTCACAGAGCATGCTCATTCTCTCCAG 
CTGTGAATTTTGTTTGAACTATTGGGACTCAGGACATAGTCCTGAAAGTTTACCTCCACA 
GTGACATCTTTAGGCAAGTCCAACATTTACGTGCCTCCTGGGCTGGAGGGTCGTTGTGCA 
GACAGCTGTCCCCTGAGCCCTGGTGGCTGGTCCTAGCACAGTTGCTGGAGACATCCCATG 
TCCGTAGTTGGAAATATGCACAAAGGATTGCTTACTCTTTTTGTTTGTTTGTTTTTTTGA 
[G,C] 

ATGGAGTCTTGCTCTTGTCCCCAAGGCTGGAGTTCAATGGCACGATCTCGGCTCACTGCA 
ACCTCCGCCTCCTGGGTTCAAGCAGTTCTCCTGCTCACCCCCTGAGTAGCTGGGATTACA 
GGTGCCCGCCACTGTGCCCAGCTAATTTTTGTATTTTAAGTAGAGACGGGGTTTCACCAT 
GTTGGCCAGGCTGGTCTCGAACTCCTGGCCTCAGGTGACCCACCAGCCTCGGCCTCTCAA 
AGTGCTGGGATTACAGGCGTGAGCCTGCCGAGAGCTTGGTCGGGGAGACCTGAACCCAGC 



(SEQ ID NO: 14) 



10504 



10971 



AGGTGGCAGGTCCGATGATGGGACAGAGGCTGTAGGTGGGGGACCTAGGGCTGCACTTGA 
GCAGAATCTTTTTTTTTTTTTTCTTTTTTTTTTTTTTGAGACAGAGTCTCGCTCTGTCAC 
CCAGGCTGGAGTGCAGTGGCGTGATCTCGGCTCACTGCACACCTCCACCTCCTTGGTTCA 
AGCGATTCTC CTGCCTC AGC CTCCC AAGTAGGTGGGACT AC AGGCACAC ACCAC CACACT 
CGGCTAATTTTTGTATTTTTAATAGAGACAGGGTTTTGCTGTGTCGGCCAGGCTGGTCTC 
[A,G] 

AACTCCTGACCTCAGGTAATCCGCCCACCTTGGCTTCTC7VAAGTGTTGGGATTACAGGTG 
TGCCAGGCCAAGCAGAATCTTAAAAAAAGGTGGGGAGAAGCTGGTGAGCAGGTGGATTTG 
GTTGAAGCAGGATGTCGACACAGAGGGGGCTTGGTGGGTAAAGGCCCTGAGCTGTGTGAG 
GTGAGGTGCCTTTAGGGCTACCTGCCACTGGGTGGAGCTGAAGTGAAGATTTGGACTGGG 
GTGGGAAGAAGGTAGTTCAGGATTTCAGGGGCCCCTGTAAGCCCCACTAAGGAGCTAAAC 

ACAAAGAAGCAGAGCATGTGGCTCTGCTCCGACCTCCACCCAATCACGACGGCCCTGTCT 
TTCAGAAAGTCCCACCGCCTCATTCTGGCTTCTCAGAGGCCCTCAGCCTTCCTTGCGCCC 
CTGGTGCTGGTGTTCTTCCTGCTGCCCCTGAGCTGAGTGCCCTGGGCAGCAGTGTCCATC 
CTCAGTTGGGGCAGGACCATGCCTGGGAGAGTGCCCGATGCTCAAGGGTGCCTTCGTCTC 
TGGGGTCTGGGACCCCAGAAAGCTCACCTGTCCTCCCCTTCTGCCAGAGCCCCATAGTCC 
[C,T] 

ATGCCTCTGTGCAGGCATTAATGTCCCCAGGTTACAGAAGAGCGAGCAGGAAGGAGTAGC 
CTGTGGTCCCTCAGCAAGGGTGTGGGGTCCTGCTTCAATACCCAAGCCCCTGACTCTAGG 
GCCCTGATCTTTGTCAGCTATGTCCCCATGCCGGGCATCAAAAACTCACCCTCCCAAGGT 
ATCTTCACCTTCCCTGATCTGTCATCCAAATTGGACCAGAGGAGCTAGACCTGGAAGAAT 
CACTTCCGCATCCACCAGGGACAGAACTGTCAGGAGGGAAGGGGCAGGGTGCGTTGTCTC 

TGAGGTGAGAGGATTGCTTAAGCCCGGGAGGGCGAGGCTGTAGTGAGCCATGATCATACC 
ACTGCACTAGAGCCTGGACAACAGAGTGAGACCGAATCACTAAAAATAAATTTTTTGAAA 
AAGGAGGAAAGGGGTCTCCCTTTGTCTTTGAAATACAGTACTGTACCTTCATCTGGCCAG 
GGCATTGCTCCGCTCCCTCCTCTGACCACCTCCTTTTATTTGCACCCTCCAGCTTTCCTG 
TGTGG C C C C ACACT C AGGG T ACT CTGG CGG CGGGG TGGTG AGGTTG T T T AAGGTGGG AAG 
[T,G] 

GGGCCTGTCCTTCCCACCTTGAACCTCCCTGCCTTTGAGACTGGGCTGTGGAGGGGAGAC 
ATCCCCTGTGCCATTGGTGACTGCTCTCTCTCCCACCTCAGCACCCGTCCGTCCCACTGG 
CTAACTGCTGGGTCGACGAGGACTGCCCCGAAGGGGAGGGAGGCACACACAGCCACGGTA 
ACTGTGGGCTCTGTCTTCCAGTGCCCCTAGCAGGGTGGGGGCCGGGCTGGGATCCTGGGT 
GGCTCCTGAGTGCAGGCCCTGCTCGCCTCTGTCCCTGCATCTCTCTTTCTGCCAACAACC 

GACCTCGTGGCCAAGGCTGGAGGGACCTTCGAGGACCTGGCGTTGCTGGTGGGTCCCAAG 
TTGGGGGCAGGGTTCCTAGAGGGCTCTGGGAGAGGGTCCCGGGCCCACCCACCGGTGGAA 
AAGCTATGTGCTATGTGCAGGGTGGCTCTGTAGGCATCAGAGTTCACTGGGATTGTGACC 
TGGACACCGGGGACTCTGGCTGCTGGCCTCACTACTCCTTCCAGCTGCAGGAGAAGAGCT 
ACAACTTCAGGTGAGGCCCCACTGCTCCCAGTGCCCAGCTGCTGGGCCCATCGCCCTCTC 
[A,C] 

CTGTGGCGGCCAGGACAGACCACACCCAGGCCCAGGCCTCTAGATATTCCACTACGTGTG 
CAAGGGGGTCCCAGGAGCAGGAGAGAGCTGTTCTCAACCCCACATCCTCCAGCACAGGCT 
CCGTCCTGCTGCCCCAAGTCCTGAGCCCTCCACCCCATCTGTCCCAGGCCCCTGCCCAGC 
TCAGGCTCCTCACTGCCAGCCCTTCCTCCACCCCACCTCGCTTCTAGTATCTCCCCTCCA 
CAGCAATGGGGTGTTTCATTTTTACTTTCCCCTTCTCCCCTTCAGCTTTGTTTTTTTTTT 

GGCCCCTGCCCAGCTCAGGCTCCTC^CTGCCAGCCCTTCCTCCACCCCACCTCGCTTCTA 
GTATCTCCCCTCC^CAGCAATGGGGTGTTTC^TTTTTACTTTCCCCTTCTCCCCTTCAGC 
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TTTGTTTTTTTTTTTTTAAGACAGAATCTCATTCTGTCACCCAGGCTGGAGTGCAGTGGC 
CCGACCTCGGCTCACTGTAACCTCTGCTTCCTGGGTTCAACCGATTCTCCTTCCTCAGCC 
TCCTGAGTAGCTGGAATTACAGGTGCTCGCCACTACTCCCAGCTAATTTTTATATTTTGG 
[T,-] 

AGATAGAGATGGGTTTTCACAATGTTGGCCAGGCTGGTCTCAAACCCCTGACCTCAGGTG 
ATCCACCCACCTCAGCCTCCCGAAGGGCTAGGATTACAGACGTAAACCACCATGTCTGGC 
CTCCCTTCCGCTTTTACCTAAACTTTTTTTTTTTTTTTGAGATGGAGTCTCACTCTGTCG 
CCCAGGCTGGAGTACAGTGGCGGGATCTCAGCTCACTGCAAGTTCCGCTTCCCGTGTTCA 
CGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACGGGTGCACGCCTCCACGCC 



(SEQ ID NO: 19) 



CCAGGAAGTCTACCCCAGTTCCCAGGGAAGAGTGAGTTCCCATCTCTGGAATCCCTCAGC 
CCTGAGCCTGCCCCTTCACATCCCCCGCTGCTGGGTCTGTTTAGGGACTCCTCTGTCCCC 
CGTCCTCTCAGCAGGCAGGGAACTTCTGAGGGAC71GGTCTTCGTTTGCTTTTTCTGTTTT 
CTCACCAATTACATAGGGCTGAGACCCAGGACTCAGGCTTGGGCTGGGGGTTTATAGAGT 
CAATTG ACAAGTTGG ACAG AGG T CTGGC AGGG C C AG C CC C AC C TGGGGGTGGG CAAAGCA 
EG, A] 

GTCACCAGAGCCTTCTTTCCTGCCCACAGGACAGCCACTCACTGGTGGGAGCAACCGGGT 
GTGGAGGCCCGCACCCTGCTCAAGCTCTATGGAATCCGCTTCGACATCCTCGTCACCGGG 
CAGGTAGGCACAGGTAGGGGTCAGGCCGGGGATGGGATGGGGCAGGCAGACAGGGCTGGA 
GGAGGCATGAGGCTGACAGTCGTGGGCTGAGAGGTTCAGCTCAGATCTCTCTCAGGCAGG 

GAAGTTCGGGCTCATCCCCACGGCCGTCACACTGGGCACCGGGGCAGCTTGGCTGGGCGT ( SEQ ID NO : 2 0 ) 



13367 TTGGCCCTGCCTCTCCCAGGTCACCTTTTTCTGTGACCTGCTACTGCTGTATGTGGATAG 
AGAAGCCCATTTCTACTGGAGGACAAAGTATGAGGAGGTGAGCTGAGGTCGCTCTGCTTG 
GACCCTGGGTTCTGCCACACTTAGGAAGATGTTGGCTGGATCCCTGACCTGCTGTCCTCA 
TCTGCAGGCCAAGGCCCCGAAAGCAACCGCCAACTCTGTGTGGAGGGAGCTGGCCCTTGC 
ATCCCAAGCCCGACTGGCCGAGTGCCTCAGACGGAGCTCAGCACCTGCACCCACGGCCAC 
IT, A] 

GCTGCTGGGAGTCAGACACAGACACCAGGATGGCCCTGTCCAAGTTCTGACACCCACTTG 
CCAACCCATTCCGGGAGCCTGTAGCCGTTCCCTGCTGGTTGAGAGTTGGGGGCTGGGAAG 
GGCGGGGCCCTGCCTGGGGATCTCAAGGATGAGGCCCCAGCATGGAGGATTGGGGGTAGA 
ATTCCACCCTTGAACCCCAGCAGACAGTCCCTCCCCTGACTCCCACCTTGGTAGGGTGCT 

GCCTCAGGGAGCCATAGAAGTCGGCTGTGTTTTGAGACGGCGACAGAACCTGACCCGTGG (SEQ ID NO: 21) 



14191 ATCGGTCCTACATGGGGCTGTGCAGCTGGAGCCAAAAAGGCAAGGTAGAAAGAGGAGTGA 
TGGGGGAGGGGGATTGTTTCAGCTTCTCTGGTGCTGTGATGCCCCAGGAGAGTCCTAATC 
TAGGGAATGGGGTGGAGTAGGCAGATAATCCACCTCCCTATCCCCCAGGCAAGGGCGGAG 
CATGTGTCTTGGGCCCACACCTGCTTAGTTTATGAGGACCGGCTGCTTTCCAGTGGTAGC 
CCTTTTGCCATGGAGGTCTGGGAGAGAGAGCAGAGGGCGGCAGGGCTAAGTTGGTGATCA 
[T, C] 

TGGGTTCTTCAGGACCTTCTATATCCCTCCTCGGTAACCCCCCAGCCCAACCCCTTGGAA 
TCTTTCCTCCAGGCTTCCTGAGAGCCCTGGGGGTGGGAGGCTGTGGGAGGCTGTACATCT 
GAAATTCACTTCAGTCCAAGTCATACCTAGGAAGCTGTCTGGGCAGCTGCTCGAGGGAGG 
CCCTGGCTCTGATCCCAGGCTGGATGGAGTGGCTGGAAGGAATGGTTCCAAACAACACCA 

CCGAGATCTCCCTCAGGCTGGCCAGGTTTTGCAGCTGGAATTCTCCTCTTGGTCCCAGGG ( SEQ ID NO : 2 2 ) 



14227 AAGGCAAGGTAGAAAGAGGAGTGATGGGGGAGGGGGATTGTTTCAGCTTCTCTGGTGCTG 
TGATGCCCCAGGAGAGTCCTAATCTAGGGAATGGGGTGGAGTAGGCAGATAATCCACCTC 
CCTATCCCCCAGGCAAGGGCGGAGCATGTGTCTTGGGCCCACACCTGCTTAGTTTATGAG 
GACCGGCTGCTTTCCAGTGGTAGCCCTTTTGCCATGGAGGTCTGGGAGAGAGAGCAGAGG 
GCGGCAGGGCTAAGTTGGTGATCATTGGGTTCTTCAGGACCTTCTATATCCCTCCTCGGT 
[A,G] 

ACCCCCCAGCCCAACCCCTTGGAATCTTTCCTCCAGGCTTCCTGAGAGCCCTGGGGGTGG 
GAGGCTGTGGGAGGCTGTACATCTGAAATTCACTTCAGTCCAAGTCATACCTAGGAAGCT 
GTCTGGGCAGCTGCTCGAGGGAGGCCCTGGCTCTGATCCCAGGCTGGATGGAGTGGCTGG 
AAGGAATGGTTCCT^C^CACCACCGAGATCTCCCTCAGGCTGGCCTVGGTTTTGCAGCT 

GG AATTCT C CT CTTGG T C C CAGGG CGGGG CAGGG AAT TCT AAG TGT C CAC C CCAGGG AGG (SEQ ID NO: 23) 



15027 AGGGCCCCTGAGGCCTGGGTATCCAAGGAGGGGCACGTGCACCTGATTCTCCTTGGGGCC 
CAGAGGAAGCTGATGTCATGGCTGGACAAAGTCACGGAGTAAAGCCAGCAAAGCCACCCT 
CTTCCTGTGTAGTCCTTACAGGCATGACTGGAAAGTTGGGGGGCATCTATGGTAGACATG 
GCACAGCCATGAAGAGACCAGTGGGGTGGTGCAGGGTGGACTTGGGGACCCTACCCCTGA 
AG ACTG AGG C C CTGC AG CT AC CAGGTGGG CT AG AAGG T AACTGG AACAGG C CTGGG C ACT 
[T,C] 

GTGCACCCAT<3TAGGAGCATGAGGGCCAC^CTCTTTTCACCTCAAAGCCCTTGAAGAGTG 
GGCAAAGACAGCAAGAGAGCTGCAGCCTGGGCCCGAGCTCAGAAACAGCTGTCGCCTCAG 
TCTGCGGACAGGCATGCACCCGAGGGTAGTGCCTGCAGGGATGCATGTGTCCCCGTGGGG 
GTGCCTGTGCCAGGGAGGCCTCAGGTGCATGCCATGCTCAGAACCCTGCTGCCCTTTCTA 

GGCAGCCTCCTTGGGGCCCAAGCTCTGCTCCCTGK3ATCTGCCACCTAGCAGACGTGGGGA (SEQ ID NO: 24) 
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15441 GCCTCAGTCTGCGCACAGGCATGCACCCCAGGGTAGTGCCTGCAGGGATGCATGTGTCCC 
CGTGGGGGTGCCTGTGCCAGGCAGGCCTCAGGTGCATGCCATGCTCAGAACCCTGCTGCC 
CTTTCTAGGCAGCCTCCTTGGGGCCCAAGCTCTGCTCCCTGGATCTGCCACCTAGCAGAC 
GTGGGGAGCCTGACCCCATGCCTGTCATGGAACCCTCCTTGCCTGGTGTGTGTGGCTCCC 
CTCTTCACTGGGCACCTGGATCCAGGCCCACCTGTGTCCCTGACTCAGGGTGGTCCCAGG 
[A,C] 

CTGGCACCTACTCTTTAGAGAGCCCCAGCATCTTTGATGTGGATTGGAGACAATTGCCTG 
GTTCCCTGGGGCAGGTGAAGACTTGGTGCCACAAAGAATGCCACAGTGGATACGCCAGCA 
GGCCACATGGCTGGCCAAGCAATTATTATTATGGATCCCTTGGGCTGTGGGCCTTCCCAT 
CCACCCCACCACAACTGCCCAGGTAGCTGGAGCTGATCATAAACAAGAAGGCTCTGGGCA 

GAGTCCATGGCACCAGCACCAGCCAAGGCCCACTCCTGAAGACCCGAAGCCCAGCCCCTG {SEQ ID NO: 25) 

Chromsome map: 

Chromosome No: 22 
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